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Input Set : A:\Omrfl821.app 
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<110> APPLICANT: Tang, Jordan J.N. 
Hong, Lin 
Ghosh, Arun K. 

<120> TITLE OF INVENTION: Inhibitors of Memapsin 2 and Use Thereof 
<130> FILE REFERENCE: OMRF 182 

<14G> CURRENT APPLICATION NUMBER: US/09/603,713 

<141> CURRENT FILING DATE: 2000-06-27 
<150> PRIOR APPLICATION NUMBER: 60/141,363 

1999-06-28 



60/177, 836 
2000-01-25 



Does Not Comply 
Corrected Diskette Needed 



<151> PRIOR FILING DATE 

<150> PRIOR APPLICATION NUMBER: 60/168,060 
<151> PRIOR FILING DATE: 1999-11-30 
<150> PRIOR APPLICATION NUMBER 
<151> PRIOR FILING DATE 
<150> PRIOR APPLICATION NUMBER: 60/178,368 
<151> PRIOR FILING DATE: 2000-01-27 
<150> PRIOR APPLICATION NUMBER: 60/210,292 
<151> PRIOR FILING DATE: 2000-06-08 
<160> NUMBER OF SEQ ID NOS : 31 
<170> SOFTWARE: Patent In Ver. 2.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 3252 
<212> TYPE: DNA 
<213> ORGANISM: Homo sapiens 
<4 00> SEQUENCE: 1 

gcgggagtgc tgcctgccca cggcacccag cacggcatcc ggctgcccct gcgcagcggc 60 
ctggggggcg cccccctggg gctgcggctg ccccgggaga ccgacgaaga gcccgaggag 120 
cccggccgga ggggcagctt tgtggagatg gtggacaacc tgaggggcaa gtcggggcag 180 
ggctactacg tggagatgac cgtgggcagc cccccgcaga cgctcaacat cctggtggat 240 
acaggcagca gtaactttgc agtgggtgct gccccccacc ccttcctgca tcgctactac 300 
cagaggcagc tgtccagcac ataccgggac ctccggaagg gtgtgtatgt gccctacacc 360 
cagggcaagt gggaagggga gctgggcacc gacctggtaa gcatccccca tggccccaac 420 
gtcactgtgc gtgccaacat tgctgccatc actgaatcag acaagttctt catcaacggc 480 
tccaactggg aaggcatcct ggggctggcc tatgctgaga ttgccaggcc tgacgactcc 540 
ctggagcctt tctttgactc tctggtaaag cagacccacg ttcccaacct cttctccctg 600 
cagctttgtg gtgctggctt ccccctcaac cagtctgaag tgctggcctc tgtcggaggg 660 
agcatgatca ttggaggtat cgaccactcg ctgtacacag gcagtctctg gtatacaccc 720 
atccggcggg agtggtatta tgaggtgatc attgtgcggg tggagatcaa tggacaggat 780 
ctgaaaatgg actgcaagga gtacaactat gacaagagca ttgtggacag tggcaccacc 840 
aaccttcgtt tgcccaagaa agtgtttgaa gctgcagtca aatccatcaa ggcagcctcc 900 
tccacggaga agttccctga tggtttctgg ctaggagagc agctggtgtg ctggcaagca 960 
ggcaccaccc cttggaacat tttcccagtc atctcactct acctaatggg tgaggttacc 1020 
aaccagtcct tccgcatcac catccttccg cagcaatacc tgcggccagt ggaagatgtg 1080 
gccacgtccc aagacgactg ttacaagttt gccatctcac agtcatccac gggcactgtt 1140 
atgggagctg ttatcatgga gggcttctac gttgtctttg atcgggcccg aaaacgaatt 1200 
ggctttgctg tcagcgcttg ccatgtgcac gatgagttca ggacggcagc ggtggaaggc 1260 
ccttttgtca ccttggacat ggaagactgt ggctacaaca ttccacagac agatgagtca 1320 
accctcatga ccatagccta tgtcatggct gccatctgcg ccctcttcat gctgccactc 1380 



file://C:\Crf3\Outhold\VsrI603713.htm 



9/5/01 



Page 2 of 7 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/603,713 



DATE: 09/05/2001 
TIME: 17:36:21 



Input Set : A:\Onirfl821.app 

Output Set: N:\CRF3\09052001\l603713.raw 



62 tgcctcatgg tgtgtcagtg gcgctgcctc cgctgcctgc 

63 gctgatgaca tctccctgct gaagtgagga ggcccatggg 

64 ggaccacacc tccgtggttc actttggtca caagtaggag 

65 cagagcacct caggaccctc cccacccacc aaatgcctct 

66 ggctggcaag gtgggttcca gggactgtac ctgtaggaaa 

67 cactctgctg gcgggaatac tcttggtcac ctcaaattta 

68 tgaaacttca gccctgaacc tttgtccacc attcctttaa 

69 cttcttttct tagtttcaga agtactggca tcacacgcag 

70 tgtggtaccc tggcagagaa gagaccaagc ttgtttccct 

71 gaggatgcac agtttgctat ttgctttaga gacagggact 

72 ggtgcaaaga ttgcctcttg aattaaaaaa aaactagatt 

73 gcggctggaa agaggagaag gagagggagt acaaagacag 

74 aggaaaggca gaaacacaac cactcaccag tcctagtttt 

75 catcccatct cagaagatgg gtgttgtttt caatgttttc 

76 accaaaagtg agatgggaag ggcttatcta gccaaagagc 

77 tgaagtgccc actaagaagt tccacttaac acatgaattt 

78 tctctatctg aaccaccctt tattctacat atgataggca 

79 cctaagctcc aggtgccctg tgggagagca actggactat 

80 tcctggtcat aggctcactc tttcccccaa atcttcctct 

81 ctaaaaggaa taggtaggag acctcttcta tctaatcctt 

82 tcattcaaca gctgatgccc tataacccct gcctggattt 

83 agtagcaaga tctttacata attcagagtg gtttcattgc 

84 cccctccatt tatttgacta aagcatcrca cagtggcact 

85 agaaatacag tgctttatgg ctctaacatt actgccttca 

86 aaggatggca gcctcagggc ttccttatgt cctccaccac 

87 catctttttc ccctatcctg ttcttcccct ccccgctcct 

88 gctggttctt gggctaggta gtggggacca agttcattac 

89 agtaaactac ggtaccagtg ttagtgggaa gagctgggtt 

90 tcctactcct acctggtcaa cccgctgctt ccaggtatgg 

91 acctgataag ggagagggaa atacaaggag ggcctctggt 

92 gcccmcaagc cataaaccaa taaamcaaga atactgagtc 

93 aaaaaaaaaa aa 

96 <210> SEQ ID NO: 2 

97 <211> LENGTH: 488 

98 <212> TYPE: PRT 

99 <213> ORGANISM: Homo sapiens 

101 <220> FEATURE: 

102 <223> OTHER INFORMATION: Purified Memapsin 2 

104 <220> FEATURE: 

105 <223> OTHER INFORMATION: Amino Acids 28-48 are remnant putative propeptide 



gccagcagca 
cagaagatag 
acacagatgg 
gccttgatgg 
cagaaaagag 
agtcgggaaa 
attctccaac 
gttaccttgg 
gctggccaaa 
g cataaacaa 
gactatttat 
ggaatagtgg 
agacctcatc 
ttttctgtgg 
tcttttttag 
ctgccatatt 
gcactgaaat 
agcagggctg 
ggagctttgc 
aaaagcataa 
cttcctatta 
cttcctaccc 
agcattatac 
gtatcaaggc 
aagagctcct 
aatggtacgt 
ctccctatca 
ttcctagtat 
gacctgctaa 
gttcctggcc 
taaaaaaaaa 



tgatgacttt 
agattcccct 
cacctgtggc 
agaaggaaaa 
aagaaagaag 
ttctgctgct 
ccaaagtatt 
cgtgtgtccc 
gtcagtagga 
gcctaacatt 
acaaatgggg 
gatcaaagct 
tccaagatag 
ttgcagcctg 
ctctcttaaa 
aatttcattg 
atcctaaccc 
ggctctgtct 
agccaaggtg 
tgttgaacat 
ggctataaga 
tctctaatgg 
caagagtatg 
tgcctggaga 
tgatgaaggt 
gggtacccag 
gttctagcat 
acccactgca 
gtgtggaatt 
tcagccagct 
aaaaaaaaaa 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3252 



106 
108 



<220> 



residues 
FEATURE : 



109 <223> OTHER INFORMATION: Amino Acids 58-61, 78, 80, 82-83, 116, 118-121, 

1, 283, and 
the OM99-2 



110 156, 166, 174, 246, 274, 276, 278-28] 

111 376-377 are residues in contact with 

112 inhibitor 

114 <220> FEATURE: 

115 <223> OTHER INFORMATION: Amino acids 54-57, 61-68, 73-80, 86-89, 109-111, 

116 113-118, 123-134, 143-154, 165-168, 198-202, and 
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117 220-224 are N-lobe Beta Strands 

119 <220> FEATURE: 

120 <223> OTHER INFORMATION: Amino Acids 184-191 and 210-217 are N-lobe Helices 

122 <220> FEATURE: 

123 <223> OTHER INFORMATION: Amino acids 237-240, 247-249, 251-256, 259-260, 

124 273-275, 282-285, 316-318, 331-336, 342-348, 

125 354-357, 366-370, 372-375, 380-383, 390-395, 

126 400-405, and 418-420 are C-lobe Beta Strands 

128 <220> FEATURE: 

129 <223> OTHER INFORMATION: Amino Acids 286-299, 307-310, 350-353, 384-387, 

130 and 427-431 are C-lobe Helices 
132 <400> SEQUENCE: 2 



133 


Ala 


Glv 


Val 


Leu 


Pro 


Ala 


His 


Glv 

\j J. y 


Thr 


Gin 


His 


Glv 

N-l -1. Jf 


He 


Arg 


Leu 


Pro 


134 


1 








5 










10 










15 




136 


Leu 


Arg 


Ser 


Glv 


Leu 


Glv 


Glv 


Ala 


Pro 


Leu 


Glv 


Leu 


Arq 


Leu 


Pro 


Arq 


137 








20 










25 










30 






139 


Glu 


Thr 


Asp 


Glu 


Glu 


Pro 


Glu 


Glu 


Pro 


Glv 


Arq 


Arq 


Glv 


Ser 


Phe 


Val 


140 






35 










40 










45 








142 


Glu 


Met 


Val 


Asp 


Asn 


Leu 


Arg 


Glv 


Lys 


Ser 


Glv 

VJ -1- Jf 


Gin 


Glv 


Tvr 


Tvr 


Val 


143 




50 










55 










60 










145 


Glu 


Met 


Thr 


Val 


Glv 
J. J 


Ser 


Pro 


Pro 


Gin 


Thr 


Leu 


Asn 


He 


Leu 


Val 


Asp 


146 


65 










70 










75 










80 


148 


Thr 


Gly 


Ser 


Ser 


Asn 


Phe 


Ala 


Val 


Glv 


Ala 


Ala 


Pro 


His 


Pro 


Phe 


Leu 


149 










85 










90 










95 




151 


His 


Arg 


Tvr 


Tvr 


Gin 


Arg 


Gin 


Leu 


Ser 


Ser 


Thr 


Tvr 


Arg 


Asp 


Leu 


Arq 


152 








100 










105 










110 






154 


Lys 


Gly 


Val 


Tyr 


Val 


Pro 


Tyr 


Thr 


Gin 


Gly 


Lys 


Trp 


Glu 


Gly 


Glu 


Leu 


155 






115 










120 










125 








157 


Gly 


Thr 


Asp 


Leu 


Val 


Ser 


He 


Pro 


His 


Gly 


Pro 


Asn 


Val 


Thr 


Val 


Arg 


158 




130 










135 










140 










160 


Ala 


Asn 


He 


Ala 


Ala 


He 


Thr 


Glu 


Ser 


Asp 


Lys 


Phe 


Phe 


He 


Asn 


Gly 


161 


145 










150 










155 










160 


163 


Ser 


Asn 


Trp 


Glu 


Gly 


He 


Leu 


Gly 


Leu 


Ala 


Tyr 


Ala 


Glu 


He 


Ala 


Arg 


164 










165 










170 










175 




166 


Pro 


Asp 


Asp 


Ser 


Leu 


Glu 


Pro 


Phe 


Phe 


Asp 


Ser 


Leu 


Val 


Lys 


Gin 


Thr 


167 








180 










185 










190 






169 


His 


Val 


Pro 


Asn 


Leu 


Phe 


Ser 


Leu 


Gin 


Leu 


Cys 


Gly 


Ala 


Gly 


Phe 


Pro 


170 






195 










200 










2a5 








172 


Leu 


Asn 


Gin 


Ser 


Glu 


Val 


Leu 


Ala 


Ser 


Val 


Gly 


Gly 


Ser 


Met 


He 


He 


173 




210 










215 










220 










175 


Gly 


Gly 


He 


Asp 


His 


Ser 


Leu 


Tyr 


Thr 


Gly 


Ser 


Leu 


Trp 


Tyr 


Thr 


Pro 


176 


225 










230 










235 










240 


178 


He 


Arg 


Arg 


Glu 


Trp 


Tyr 


Tyr 


Glu 


Val 


He 


He 


Val 


Arg 


Val 


Glu 


He 


179 










245 










250 










255 




181 


Asn 


Gly 


Gin 


Asp 


Leu 


Lys 


Met 


Asp 


Cys 


Lys 


Glu 


Tyr 


Asn 


Tyr 


Asp 


Lys 


182 








260 










265 










270 






184 


Ser 


He 


Val 


Asp 


Ser 


Gly 


Thr 


Thr 


Asn 


Leu 


Arg 


Leu 


Pro 


Lys 


Lys 


Val 


185 






275 










280 










285 








187 


Phe 


Glu 


Ala 


Ala 


Val 


Lys 


Ser 


He 


Lys 


Ala 


Ala 


Ser 


Ser 


Thr 


Glu 


Lys 
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188 290 295 300 



190 


Phe 


Pro 


Asp 


Gly 


Phe 


Trp 


Leu 


Gly 


Glu 


Gin 


Leu 


Val 


Cys 


Trp 


Gin 


Aia 


191 


305 










310 










315 










320 


193 


Glv 


Thr 


Thr 


Pro 


Trp 


Asn 


He 


Phe 


Pro 


Val 


He 


Ser 


Leu 


Tvr 


Leu 


Met 


194 










325 










330 










335 




196 


Glv 


Glu 


Val 


Thr 


Asn 


Gin 


Ser 


Phe 


Ara 


He 


Thr 


He 


Leu 


Pro 


Gin 


Gin 


197 








340 










345 










350 






199 


Tvr 


Leu 


Arg 


Pro 


Val 


Glu 


Asp 

XT 


Val 


Ala 


Thr 


Ser 


Gin 


Asp 


Asp 


Cys 


Tyr 


200 






355 










360 










365 








202 


Lys 


Phe 


Ala 


He 


Ser 


Gin 


Ser 


Ser 


Thr 


Gly 


Thr 


Val 


Met 


Gly 


Aia 


Vai 


203 




370 










375 










380 










205 


He 


Met 


Glu 


Gly 


Phe 


Tyr 


Val 


Val 


Phe 


Asp 


Arg 


Ala 


Arg 


Lys 


Arg 


He 


206 


385 










390 










395 










400 


208 


Gly 


Phe 


Ala 


Val 


Ser 


Aia 


Cys 


His 


Val 


His 


Asp 


Glu 


Phe 


Arg 


Thr 


Aia 


209 










405 










410 










415 




211 


Ala 


Val 


Glu 


Gly 


Pro 


Phe 


Val 


Thr 


Leu 


Asp 


Met 


Glu 


Asp 


Cys 


Gly 


Tyr 


212 








420 










425 










430 






214 


Asn 


He 


Pro 


Gin 


Thr 


Asp 


Glu 


Ser 


Thr 


Leu 


Met 


Thr 


He 


Aia 


Tyr 


Vai 


215 






435 










440 










445 








217 


Met 


Ala 


Ala 


He 


Cys 


Aia 


Leu 


Phe 


Met 


Leu 


Pro 


Leu 


Cys 


Leu 


Met 


Val 


218 




450 










455 










460 










220 


Cys 


Gin 


Trp 


Arg 


Cys 


Leu 


Arg 


Cys 


Leu 


Arg 


Gin 


Gin 


His 


Asp 


Asp 


Phe 


221 


465 










470 










475 










480 


223 


Ala 


Asp 


Asp 


He 


Ser 


Leu 


Leu 


Lys 



















224 485 

227 <210> SEQ ID NO: 3 

228 <211> LENGTH: 503 

229 <212> TYPE: PRT 

230 <213> ORGANISM: Homo sapiens 

232 <220> FEATURE: 

233 <223> OTHER INFORMATION: Pro-memapsin 2 

235 <220> FEATURE: 

236 <223> OTHER INFORMATION: Amino Acids 1-15 are vector-derived residues 

238 <220> FEATURE: 

239 <223> OTHER INFORMATION: Amino Acids 16-64 are a putative pro peptide 

241 <220> FEATURE: 

242 <223> OTHER INFORMATION: Amino Acids 1-13 are the T7 promoter 

244 <220> FEATURE: 

245 <223> OTHER INFORMATION: Amino Acids 16-456 are Pro-memapsin 2-Tl 
24 7 <220> FEATURE: 

248 <223> OTHER INFORMATION: Amino Acids 16-421 are Promemapsin 2-T2 

250 <400> SEQUENCE: 3 

251 Met Aia Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met Ala 

252 15 10 15 

254 Gly Vai Leu Pro Aia His Gly Thr Gin His Gly He Arg Leu Pro Leu 

255 20 25 30 

257 Arg Ser Gly Leu Gly Gly Aia Pro Leu Gly Leu Arg Leu Pro Arg Glu 

258 35 40 45 

260 Thr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Vai Glu 
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261 




50 










55 




263 


Met 


Val 


ASD 


Asn 


Leu 


Arq 


Gly 


Lvs 


264 


65 










70 






266 


Met 


Thr 


Val 


Glv 


Ser 


Pro 


Pro 


Gin 


267 










85 








269 


Glv 


Ser 


Ser 


Asn 


Phe 


Ala 


Val 


Glv 

W -1- Jf 


270 








100 










272 


Arg 


Tvr 


Tvr 


Gin 


Ara 


Gin 


Leu 


Ser 


273 






115 










120 


275 


Glv 


Val 


Tvr 

a. y J. 


Val 


Pro 


Tvr 


Thr 


Gin 


276 




130 










X O j 




278 


Thr 


Asp 


Leu 


Val 


Ser 


He 


Pro 


His 


279 


145 










150 






281 


Asn 


lie 


Ala 


Ala 


He 


Thr 


Glu 


Ser 


282 










165 








284 


Asn 


TrD 


Glu 


Glv 


He 


Leu 


Glv 


Leu 


285 








180 










287 


Asp 


Asp 


Ser 


Leu 


Glu 


Pro 


Phe 


Phe 


288 






195 










200 


290 


Val 


Pro 


Asn 


Leu 


Phe 


Ser 


Leu 


Gin 


291 




210 










215 




293 


Asn 


Gin 


Ser 


Glu 


Val 


Leu ' 


Ala 


Ser 


294 


225 










230 






296 


Glv 


lie 


Asp 


His 


Ser 


Leu 


Tvr 


Thr 


297 










245 








299 


Arg 


Arg 


Glu 


Tro 


Tvr 


Tvr 


Glu 


Val 


300 








260 










302 


Glv 


Gin 


Asp 


Leu 


Lys 


Met 


Asp 


Cys 


303 






275 










.280 


305 


lie 


Val 


Asp 


Ser 


Glv 


Thr 


Thr 


Asn 


306 




290 










295 




308 


Glu 


Ala 


Ala 


Val 


Lys 


Ser 


He 


Lys 


309 


305 










310 






311 


Pro 


Asp 


Gly 


Phe 


Trn 


Leu 


Gly 


Glu 


312 










325 








314 


Thr 


Thr 


Pro 


Tro 


Asn 


He 


Phe 


Pro 


315 








340 










J J. / 


va J. Li 


V a. -L 


L 111. 




r;i n 


O C i- 


Php 


Arg 


318 






355 










360 


320 


Leu 


Arg 


Pro 


Val 


Glu 


Asp 


Val 


Ala 


321 




370 










375 




323 


Phe 


Ala 


He 


Ser 


Gin 


Ser 


Ser 


Thr 


324 


385 










390 






326 


Met 


Glu 


Gly 


Phe 


Tyr 


Val 


Val 


Phe 


327 










405 








329 


Phe 


Ala 


Val 


Ser 


Ala 


Cys 


His 


Val 


330 








420 










332 


Val 


Glu 


Gly 


Pro 


Phe 


Val 


Thr 


Leu 


333 






435 










440 
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60 



Ser 


Gly 


Gin 


Gly 


Tyr 


Tyr 


Val 


Glu 






75 










80 


Thr 


Leu 


Asn 


He 


Leu 


Val 


Asp 


Thr 




90 










95 




Ala 


Ala 


Pro 


His 


Pro 


Phe 


Leu 


His 


105 










110 






Ser 


Thr 


Tvr 


Arq 


Asp 


Leu 


Arg 


Lys 










125 








Glv 


Lvs 


Trp 


Glu 


Gly 


Glu 


Leu 


Gly 








140 










Glv 


Pro 


Asn 


Val 


Thr 


Val 


Arq 


Ala 






155 










160 


Asp 


Lys 


Phe 


Phe 


He 


Asn 


Gly 


Ser 




170 










175 




Ala 


Tvr 


Ala 


Glu 


He 


Ala 


Arg 


Pro 


185 










190 






Asp 


Ser 


Leu 


Val 


Lys 


Gin 


Thr 


His 










205 








Leu 


Cys 


Gly 


Ala 


Glv 


Phe 


Pro 


Leu 








220 










Val 


Glv 


Glv 


Ser 


Met 


He 


He 


Gly 






235 










240 


Glv 


Ser 


Leu 


Tro 


Tvr 


Thr 


Pro 


He 




250 










255 




He 


He 


Val 


Arg 


Val 


Glu 


He 


Asn 


265 










270 






Lys 


Glu 


Tvr 


Asn 


Tvr 
J. jf a- 


Asp 


Lvs 


Ser 










285 








Leu 


Arg 


Leu 


Pro 


Lvs 


Lvs 


Val 


Phe 








300 










Ala 


Ala 


Ser 


Ser 


Thr 
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VERIFICATION SUMMARY r..™ 

PATENT .PP.IC.;^"'' „S/CVS03,,U ?S; T^l'.'^f' 

Input Set : A:\Omrfl821.app 

Output Set: N:\CRF3\09052001\1603713.raw 

'jffjr^ot'f ^^r.^"^^^"^ Application Nu^er 
L:580 M:258 W: MandatorJ Feature ITss^' till Tnl ^''"h 1°' '"^^'^ 
L:580 M:341 W: (46) "n" or "Xa^" T.l^ V III f ID#:22 
t.c;qt m -,.= 0 used, for SEQ ID#:22 

L:597 M:258 W: Mandatory Feature missina <??1> ^ ^ . 

L:597 M:258 W: Mandator^ Feature Mss^nq' 022^ no. ^^'^ = 

L:597 M:341 W: (46) "n" or "yTJ' "'^^^'•''f' <222> not found for SEQ ID#:23 
L-707 M.25q W All h v. ^^# = 23 

L.707 M.259 W. Allowed number of lines exceeded, <223> Other Information: 
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